Covariate Powered Cross-Weighted Multiple Testing
نویسندگان
چکیده
A fundamental task in the analysis of datasets with many variables is screening for associations. This can be cast as a multiple testing task, where objective achieving high detection power while controlling type I error. We consider $m$ hypothesis tests represented by pairs $((P_i, X_i))_{1\leq i \leq m}$ p-values $P_i$ and covariates $X_i$, such that $P_i \perp X_i$ if $H_i$ null. Here, we show how to use information potentially available about heterogeneities among hypotheses increase compared conventional procedures only $P_i$. To this end, upgrade existing weighted through Independent Hypothesis Weighting (IHW) framework data-driven weights are calculated function covariates. Finite sample guarantees, e.g., false discovery rate (FDR) control, derived from cross-weighting, data-splitting approach enables learning weight-covariate without overfitting long partitioned into independent folds, arbitrary within-fold dependence. IHW has increased methods do not covariate information. key implication rejection common setups should proceed according ranking p-values, but an alternative implied covariate-weighted p-values.
منابع مشابه
Importance-Weighted Cross-Validation for Covariate Shift
A common assumption in supervised learning is that the input points in the training set follow the same probability distribution that the input points used for testing follow. However, this assumption is not satisfied, for example, when the outside of training region is inter/extrapolated. The situation where the training input points and test input points follow different distributions is call...
متن کاملCovariate Shift Adaptation by Importance Weighted Cross Validation
A common assumption in supervised learning is that the input points in the training set follow the same probability distribution as the input points that will be given in the future test phase. However, this assumption is not satisfied, for example, when the outside of the training region is extrapolated. The situation where the training input points and test input points follow different distr...
متن کاملWeighted multiple testing correction for correlated tests.
Virtually all clinical trials collect multiple endpoints that are usually correlated. Many methods have been proposed to control the family-wise type I error rate (FWER), but these methods often disregard the correlation among the endpoints, such as the commonly used Bonferroni correction, Holm procedure, Wiens' Bonferroni fixed-sequence (BFS) procedure and its extension, and the alpha-exhausti...
متن کاملWeighted False Discovery Rate Control in Large-Scale Multiple Testing
The use of weights provides an effective strategy to incorporate prior domain knowledge in large-scale inference. This paper studies weighted multiple testing in a decisiontheoretic framework. We develop oracle and data-driven procedures that aim to maximize the expected number of true positives subject to a constraint on the weighted false discovery rate. The asymptotic validity and optimality...
متن کاملPartial Knowledge in Multiple-Choice Testing
The intent of this study was to discover the nature of (partial) knowledge as estimated by the multiple-choice (MC) test method. An MC test of vocabulary, including 20 items, was given to 10 participants. Each examinee was required to think aloud while focusing on each item before and while making a response. After each test taker was done with each item, s/he was ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of The Royal Statistical Society Series B-statistical Methodology
سال: 2021
ISSN: ['1467-9868', '1369-7412']
DOI: https://doi.org/10.1111/rssb.12411